Non-Gaussian sampling effects on the CMB power spectrum estimation 
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We introduce a non-Gaussian model of structure formation, and show how it produces surprising 
sampling effects on the estimation of the CMB power spectrum. Estimates of the average power 
spectrum based on small sky patches produce qualitatively different results from patch to patch, 
and vary also with the patch size. For instance a system of secondary Doppler peaks is observed 
in small enough patches, but at different positions in different patches. However, no secondary 
Doppler peaks are observed in large sky patches. These effects question usual methods of power 
spectrum estimation, and techniques like bootstrapping. The non-Gaussianity of this model fails to 
be detected by standard tests. 

PACS Numbers : 98.80. Cq, 95.35+d 

Considerable effort is currently being put into the es- 
timation of the power spectrum Cf of CMB tempera- 
ture fluctuations The Q are expected to provide 
a wealth of information on theories of the early Uni- 
verse, discriminating between paradigms , and allow- 
ing highly accurate measurements of cosmological param- 
eters 1^,0]. The assumption that the temperature fluctu- 
ations constitute a Gaussian random field plays a central 
role in nearly all data-analysis methods (see however [^). 
This assumption has been checked in some data sets (eg. 
lH), although one can argue that the tests applied may 
not be conclusive |lO|. One may certainly devise subtle 
non-Gaussian theories which pass the traditional Gaus- 
sian! ty tests. 

Although non-Gaussianity cannot show up at the level 
of the average power spectrum C^, it may affect its esti- 
mation from a single sky. Larger Cg cosmic variance er- 
ror bars, and correlations between all-sky Cg are familiar 
non-Gaussian effects known to be present in some mod- 
els [0. These effects would certainly affect comparison 
between theoretical and experimental Cg. Correlations 
between Ci would allow more structure to be present in 
each observed Cg spectrum than in the average Ci. Also 
one would have to revise the allocation of error bars to ex- 
perimental Ci estimates, since these errorbars normally 
include cosmic variance. 

More interesting still is the possibility that non- 
Gaussianity might affect in a similar way sampling statis- 
tics, that is, cosmic variance in sky patches and the coor- 
dination of estimates derived from different sky patches. 
It could for instance happen that estimates of the aver- 
age power spectrum based on small sky patches produce 
different results from patch to patch. New results again 
could be obtained by changing the size of the patch. 

To give a motivated example, take a defect theory 
known not to display secondary Doppler peaks It 
could happen that power spectrum estimates using small 
patches of the sky revealed a structure of secondary 
Doppler peaks, but that different patches saw peaks at 
different positions. The Cg average could then display 
no secondary Doppler peaks, and it is this average that 
would be sampled by a large enough patch of the sky. We 
will see that this non-Gaussian effect may be achieved 



with a coherent |13| non-Gaussian source. For large sky 
patches such a source mimics the incoherent phenom- 
ena studied in |12|. There is therefore a loophole in the 
argument connecting absence of secondary peaks and in- 
coherence, if one allows for non-Gaussianity. 

As we shall see, it is possible to construct theories 
which display this effect while still passing the tradi- 
tional Gaussian tests. This is somewhat disturbing, as 
for signals coming from such theories the usual meth- 
ods for estimating the power spectrum would be grossly 
wrong, and in any case miss the theory's rich structure 
of spectra. Techniques like bootstrapping would not be 
appropriate. 

Here we present a simple model with all the proper- 
ties mentioned above. We regard it as a valid model of 
structure formation deserving attention in its own right. 
However we reserve for a longer publication a pre- 
sentation of the details of the model, and concentrate 
here on its unusual sampling effects. The idea is that 
structure is due to seeds in the early Universe, which act 
as a separate inhomogeneous component. All we know 
about them is that their formation and evolution satis- 
fies energy conservation and that their power spectrum 
respects the causality constraints . In this sense these 
seeds are like the seeds in the isocurvature models of Q . 
They represent an ad-hoc construction led merely by the 
guiding principles of causality and energy conservation. 

However we postulate some dynamics. We require 
these seeds to be explosions, as in the mimic models dis- 
cussed in 1 17 1^. They differ from mimic explosions only 
in that they do not explode at the Big Bang, but at 
a given blast time ry^. When they explode they start 
seeding structure. Before the explosion they merely lurk 
about imparting no gra vitational action on the surround- 
ing matter. As in |l7||l^ ] we require these seeds to be 
scale- invariant . 

A final but central ingredient of the model is that the 
blast time r^f, is random, with a probability distribution 
P{rih), and is uncorrelated beyond a given distance ^. 
This is the only qualitative novelty of our model, but this 
feature will be responsible for all the weird non-Gaussian 
behaviour to be derived. In summary structure is seeded 
by bombs which have always been around, but which re- 



1 



main dormant for part of their lives, then explode, not at 
a pre-set time, but whenever it pleases them. Do not con- 
fuse this scenario with the explosive scenarios proposed 
in po|] . Our explosions satisfy energy conservation sep- 
arately, and only interact with the rest of the Universe 
gravitationally. 

More concretely we define our bombs by means of their 
stresses. If is their stress energy tensor, then we have 
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0* may then be Fourier transformed. We choose G'* 
and = for rj < rjb, but 
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for r] > r]h. For dcfiniteness we take A — 1. Switching on 
stresses rather than energy ensures that we do not violate 
energy conservation. It is also reminiscent of the switch- 
ing on of topological defects during a phase transition 
|19| . The other components of the stress energy tensor 
are determined by the energy conservation equations. 

The above specifies each individual explosion. If we set 
up a Poisson process of such explosions (with fixed rji,) 
then the resulting field power spectrum equals the density 
of explosions times the square of the Fourier transform of 
each individual explosion [ p^ . If furthermore the density 
of explosions is very large we have a Gaussian random 
field within each correlated region. We assume that the 
source is coherent within each correlated region. The 
calculation of the C'i conditional to a given r]b can then 
be made in the usual way |]l7| , |l8| . 

We finally choose a distribution of blast times lifted 
from radioactive decay: 
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where ti, is the average blast time. We assume that the 
spatial correlation between the blast times rji, dies off at 
a scale ^ which when projected on the last scattering 
surface obeys ^ ^ 47r (in radians). 

The CMB anisotropies produced by such a theory ex- 
hibit a rich structure of power spectra. For patches of sky 
with a size L smaller than ^ the system is best described 
by noting that the temperature probability distribution 
conditional to a given i]b is Gaussian. The observed power 
spectrum Ci conditional to rji, is the usual distribu- 
tion. We define the conditional average power spectrum 
as: 



Ciirib) = J dCiCiP{Ci\r]b) 
The sample variance is the usual 
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where Ni — [21 + l)/sky is the total number of modes 
contributing to the estimate, with /sky the fraction of 




FIG. 1. The spectrum of angular power spectra Ci{rjt,) 
which would be sampled in small sky patches. We only show 
spectra with a non-negligible probability, for Tb < r/*, where 
rjt is the conformal time at last scattering. 



sky covered by the patch. Conditional to a given rjh there 
are no correlations between the Ci (other than the ones 
mentioned in ||2l[] , which we assume have been bypassed) . 
This is the simplest description for patches with L < ^: 
"a spectrum of spectra" Cg{r]b), and the distribution of 
the parameter rji,. In other words, the statistical ensem- 
ble is best described in terms of a set of Gaussian sub- 
ensembles, with a power spectrum indexed by a variable 
rjb. From the point of view of the overall ensemble rji, is 
itself a random variable, with a distribution P{rjb). In 
Fig. |l| we show the various power spectra Ci{rib) which 
any estimate derived from small patches would produce. 

Applying the traditional description, in terms of an av- 
erage plus a sample variance error bar cr^(C'f), is a bad 
idea for patches with L < ^. Although the temperature 
distribution conditional to rjb is Gaussian, the marginal 
distribution is not. Non-Gaussian effects on the power 
spectrum therefore emerge. The marginal average power 
spectrum is 

= {Ce) = J dCidf]bCeP{Ci,iib) (5) 
Using P{Ci,T]b) = P{Ci\T]b)P{vb) leads to 

Ce = f dj^bPiVb)Ce{vb) (6) 



The sample variance is now 

'^'{Ci) = (1 + / dvbC^ivb)PiVb) - C! (7) 

always larger than the Gaussian sample variance 2C'j/Ni 
by virtue of the Schwarz inequality. If the distribution 
P{rib) is reasonably peaked and Ce{r]b)Cg (rjb) <^ C/{r]b) 
we get the "error propagation formula" : 
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FIG. 2. The marginal average power spectrum Cf. This is a 
bad description for observations on smaU patches with L < ^. 
It does however represents the power spectrum sampled by 
large patches. 



Even if Ni = oo, there would be a residual, purely para- 
metric, cosmic variance: 



There are also correlations between Ce 



(9) 



cov(C',,QO - J diibCt{fib)Ctijib)P(jlb) ~ CiCt (10) 

Larger sample variance error bars and correlations be- 
tween Ci ( "cosmic covariance" ) no doubt complicate 
comparison between theory and experiment. One is bet- 
ter off without them, which in this case means abandon- 
ing the Ci description, and taking up instead Ci{r]b) and 

In Fig. g we plotted the average power spectrum Cg. 
Clearly the larger sample variance error bars merely hint 
at the fact that the average power spectrum Ci fails to 
describe what would be observed in each small patch of 
the sky with L < What is seen in any patch is one 
of the Ce{r]b) spectra with Gaussian fluctuations about 
it, not the Ci spectrum with a larger random scattering 
about it. 

If we choose a patch with i ^ ^ then the situation is 
different. Power spectrum estimates based on the whole 
patch amount to an average of the estimates that would 
be supplied by each correlated region with size ^. Let us 
index these regions by i and let there be iV = (L/S,)'^ of 
them, so that Ce =Y1 CyN. The statistics of C} are the 
same as above with Ni = NNl, and so we have 
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FIG. 3. Measured power spectra in a patch 4° across, and 
in a patch 20° across. A maximum likelihood estimate of 
the average power spectrum based on small patches would 
reveal a system of secondary peaks, at different positions in 
different patches. The same exercise in a large patch would 
suggest absence of secondary peaks. 



and 



cov(Ci,Q') = — 



(12) 



If the distribution P{rjb) is reasonably peaked we get 



(11) 



We see that the excess sample variance, and the corre- 
lations between get smaller as the number of uncor- 
related regions in the patch N increases. As iV — > oo 
the residual parametric cosmic variance mentioned above 
also disappears. Essentially what happened is that for 
large patches the average power spectrum Ci has become 
representative of the observed Ci, . Observed Ce for large 
patches do scatter around a curve as in Fig. ||. 

We illustrate this phenomenon further in Fig. ^. We 
chose ^ = 5°, generated maps [|4j, and found the mea- 
sured power spectrum in a square patch 4° across, and in 
a patch with 20°. Secondary Doppler peaks are observed 
in each realization for small sky patches. However the 
peak's positions vary from patch to patch, or from real- 
ization to realization. Hence the average over realizations 
does not exhibit secondary Doppler peaks. Similarly es- 
timates based on large patches do not show secondary 
Doppler peaks. This effect was found by Turok in cosmic 
string simulations |22|. 
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FIG. 4. Histograms of skewness 73 and kurtosis 74 as es- 
timated from 25600 pixels for a Gaussian theory (line) and 
our theory (dash). These histograms have used 10000 realiza- 
tions. 



The theory we have just proposed is somewhat distress- 
ing. The non-Gaussianity of its maps fails to be detected 
by the traditional tests, such as skewness and kurtosis, 
density of peaks above a given height, genus number and 
the 3-point correlation function. We show how this is 
the case with the skewness 73 and the kurtosis 74. In 
Fig.^ we plot histograms of 73 and 74 as estimated from 
25600 pixels for 10000 realizations. Plots with dashes 
correspond to our theory, with lines to a Gaussian theory 
with the same average spectrum. As far as skewness 
is concerned the two theories are essentially the same. 
Our theory has a slightly positive kurtosis, but not be- 
yond what could be explained from cosmic variance for 
a Gaussian theory. Even this latter effect can be elimi- 
nated by changing the distribution P{rib) so as to ensure 
that the most likely power spectra Ci{rjb) have the same 
/ Mi'^Ci. This amounts to reducing the probability of 
patches with rjh « 0. 

Despite its apparent Gaussianity, the theory proposed 
shows a sampling effect on the power spectrum that 
invalidates traditional estimation methods. In particu- 
lar bootstrapping becomes very misleading. One should 



rather aim at independent power spectrum estimates for 
all possible combinations of patch location and size. The 
outcome would be Ce{rib), Pivt), and A standard esti- 
mation of the power spectrum would be highly misrepre- 
sentative of the actual structure of spectra of the theory. 

There is an irritating inconsistency between power 
spectrum estimates produced by different experiments, 
using different patches of the sky, or patches with differ- 
ent sizes Although these may be due to imperfect 
foreground subtraction, or calibration errors, it is curi- 
ous to point out that such inconsistencies might not be 
inconsistencies at all. They may be telling us about the 
unusual power spectrum structure non-Gaussianity may 
induce. 

Finally our theory shows that maybe we should be 
more careful when ruling out defects on the grounds of 
unflattering average power spectra Ce 1 24 - 26| . We know 
very little about non-Gaussianity in these models, but 
nothing tells us that comparison between theoretical and 
experimental Ce should be made in the same way as for 
Gaussian theories. 
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